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, . Abstract 

Detailed information about the weight distribution of a convolutional code is given by 
the adjacency matrix of the state diagram associated with a controller canonical form of 
the code. We will show that this matrix is an invariant of the code. Moreover, it will 
be proven that codes with the same adjacency matrix have the same dimension and the 
same Forney indices and finally that for one-dimensional binary convolutional codes the 
^ I adjacency matrix determines the code uniquely up to monomial equivalence. 
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The weight distribution of a code forms an important parameter containing a lot of informa- 
tion about the code for practical as well as theoretical purposes. Quite some research in block 
code theory has been devoted to the investigation of the weight distribution and to weight 
5^ , preserving maps. The most famous results in this area are certainly the Duality Theorem 

of Mac Williams which presents a transformation between the weight distributions of a given 
code and its dual as well as MacWilliams' Equivalence Theorem stating that two isometric 
codes are monomially equivalent. 

In this paper we will address these topics for convolutional codes. In |21| it has been 
shown that there cannot exist a duality theorem for the weight distribution of convolutional 
codes and their duals, if the weight distribution is defined to be enumerating the atomic 
codewords. The authors present two binary one-dimensional codes having the same weight 
distribution while their duals have different ones. We will present this example at the end of 
Section[51 It indicates that the thus defined weight distribution contains too little information 
about the code when it comes to duality. In this paper we will concentrate on a different type 
of weight distribution containing considerably more information. This is the adjacency matrix 
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of the state diagram associated with the controller canonical form of a minimal encoder. Since 
for block codes this matrix reduces to the usual weight distribution, it can be regarded as 
a generalization of the latter to convolutional codes. Then the following issues arise. First 
of all, it needs to be clarified whether this matrix is an invariant of the code, i. e., does 
not depend on the choice of the minimal encoder. Since at any rate the adjacency matrix 
depends on the chosen ordering on the state space, it is reasonable to factor out the effect 
of the ordering. This brings us to the notion of generalized adjacency matrix for which 
then we can answer the question above in the affirmative in Section^ Secondly and more 
interestingly, we will approach the problem of how much information about the code does the 
generalized adjacency matrix contain. More precisely, what is the relation between two codes 
sharing the same generalized adjacency matrix? In Section [S] we will first show that two such 
codes always have the same Forney indices. Thereafter, we will restrict ourselves to the most 
important class of convolutional codes from a practical point of view, the one-dimensional 
binary codes. With the aid of MacWilliams' Equivalence Theorem for block codes we will 
show that such codes share the same generalized adjacency matrix if and only if they are 
monomially equivalent. One should bear in mind that this result cannot be expected for 
general codes, since it is not even true for higher-dimensional binary block codes. However, 
as a consequence we obtain that if two binary one-dimensional convolutional codes share the 
same generalized adjacency matrix then so do their dual codes. This shows in particular 
that the example in [2^ mentioned above does not apply if we consider the generalized 
adjacency matrix rather than the weight distribution of the atomic codewords. The result 
just mentioned tempts us to conjecture that two codes have the same generalized adjacency 
matrix if and only if this is true for the dual codes, or, more explicitly, that there might be a 
MacWilliams duality for the generalized adjacency matrices of convolutional codes. Indeed, 
in jTj in a totally different form a MacWilliams tranformation rule has been established for 
convolutional codes of the smallest kind, that is, with overall constraint length being one. 
We will present this transformation in (|5.2)) at the end of this paper. Proving or disproving 
the conjecture for general codes, however, appears to be quite a difficult problem and has to 
remain open for future research. 

In the next two sections we will introduce the material as necessary for deriving our 
results. We will discuss the controller canonical form of an encoder matrix and the associated 
state diagram along with its adjacency matrix. This will in particular bring us to a state- 
space description of convolutional codes as introduced first by Massey and Sain ^2j and 
discussed in many other papers, see, e. g., [HE]. Since the results as we will need them 
later on are somewhat spread over the literature and proofs are not always easily accessible, 
we think it is worthwhile presenting the material of the next two sections as self-contained as 
possible, even though most of the results are not new. Furthermore, as opposed to the existing 
literature we will give a purely matrix theoretic approach. Following McEliece [Tl] we will 
introduce the notions of atomic and molecular codewords and show how the corresponding 
weight distribution can be derived, theoretically, from the adjacency matrix of the state 
diagram. Other methods of computing the weight distribution are Viterbi's method, see [H] 
or |H1 Sec. 3.10], or Mason's gain formula as described in Sec. 10.2]. For us McEliece's 
approach has the advantage to be fully in matrix theoretic terms. This way we will obtain 
a consistent presentation throughout the whole paper which also bears the hope that we can 
bring this interesting topic closer to the mathematical community. Moreover, his method 
leads us directly to the adjacency matrix which will be the central object of our studies in 
the last two sections. At the end of Section|31 we will also briefly discuss two types of distance 
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parameters which have been introduced in a totahy different context for convolutional codes, 
the extended row distances and the active burst distances. Both are cfosely related to the 
notions of atomic and molecular codewords and, as we will see, therefore also appear in the 
weight distribution. Sections |1] and [5] contain the new results as described in the previous 
paragraph. We will close the paper with some open problems in Sectional 

We end the introduction with presenting the basic notions of convolutional coding theory. 
Throughout the paper the symbol F stands for any finite field while ¥q always denotes a field 
with q elements. The ring of polynomials and the field of formal Laurent series over F are 
given by ¥[z] = { Zf=o fj^' | G No, /, € F} and F((z)) = { ZT=l fj^' | / e Z, G F}. The 
following definition of a convolutional code is standard. 

Definition 1.1 Let F = Fg. A convolutional code with parameters (n, k, 5)q is a /c-dimensional 
subspace C of the vector space F((z))" of the form C = imG := {uG \ u G ¥{{zY} where G 
is a matrix in F[z]'^^"' that is basic, i. e. there exists some matrix G G F[z]"'^'^ such that 
GG = Ik, and satisfies 5 = max{deg7 | 7 is a A:-minor of G}. We call G a generator matrix 
or encoder and S the overall constraint length of the code C. 

Notice that, by definition, a generator matrix is always polynomial and has a polynomial 
right inverse. This implies that in the situation of Definition 11.11 the polynomial codewords 
belong to polynomial messages, i. e. Cpoi := CnF[z]" = {uG I u G ¥[zf}. In other words, the 
generator matrix is delay-free and non-catastrophic. As a consequence, a convolutional code 
is always uniquely determined by its polynomial part. Precisely, if C = imG and C = imG' 
where G, G' G F[z]'^^" are both basic, then 

C = C ^ C n ¥[zf = C'r\ ¥[zf ^G' = UG for some U G Glk{¥{z\). (1.1) 

This also shows that the overall constraint length of a code does not depend on the choice of 
the generator matrix. 

It is well-known Thm. 5] or p. 495] that each code has a minimal generator matrix 
in the sense of the next definition and that such a matrix can be obtained constructively from 
a given generator matrix, see 13 Sec. 4]. For a polynomial vector v G ¥[z\'^ we define degv 
to be the maximum degree of its entries and, as usual, we put degO = —00. 

Definition 1.2 Let G G F[2:]'^^"' be a basic matrix with overall constraint length 5 and let 
vi, . . . ,Vk be the degrees of the rows of G. We say that G is minimal ii 5 = X]i=i ^i- ^^is 
case, the row degrees of G are uniquely determined by the code C := imG C F((z))"'. They 
are called the Forney indices of C. The maximal Forney index is called the memory of the 
code. 

From the above it follows that a convolutional code with parameters (n, k, 6) has a constant 
generator matrix if and only if 5 = 0. In that case the code can be regarded as an (n, k) 
block code. 

The definition of weight and distance in convolutional coding theory is straightforward. 
For a polynomial vector v G F[z]" with deg(f ) = N and for j = 0, . . . , N we define vj G F" to 
be the vector coefficient of z^ in v. Then the weight of v is defined to be wt{v) = Y^f=o wt(t>j) 
where wt{vj) denotes the Hamming weight of vj G F". The (free) distance of a code C C 
F((2))" is given as dist(C) := min {wt(u) | v G Cpoi, w 7^ O}. 
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2 Controller canonical form and state diagram 



In this section we introduce the matrix representation of the controller canonical form for 
a given encoder matrix G. The interpretation of this form has been discussed in detail in 
01 Sec. II], IHj, and |H[ Sec. 2.1]. The relation to the encoding process will be made clear 
below. The results of this section are in essence well-known from the references above and 
other coding literature. However, since we were not able to find detailed references including 
proofs for all results we think it is worthwhile to summarize them with strict matrix theoretical 
proofs. We will make heavy use of these results in later sections. 



Definition 2.1 Let G = (gij) £ F[z] be a generator matrix with row degrees 71, . . . ,7fe 
and let gij = Yl2=o di'j^ ■^'^ ■ 7 ~ Si=i 7« ^^'^ asume 7 > 0. For i = 1, . . . , define 
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as well as -D = (5^^^) £ ip'^^xn^ case 7i = for some i, the corresponding block is missing 
and in 5 a zero row occurs. We call (A, B,C,D) the controller canonical form of the code C = 
imG. 

Let us first investigate the properties of G in terms of its controller canonical form. 

Lemma 2.2 Let G, A, B, G, and D be as in Definition \2.1l Then 

(1) G = B{z~^I - A)-^C + D = zB{I - zA)-^G + D. 

(2) G is minimal in the sense of Definition \1.2\ if and only if rank [A, C] = 7. 



Proof: (1) It is easy to see that 
/z z" ••• 



771 



B{z-^1-A) 



-1 



z z 



yl2 



\ 



z z 



(2.11 



where a zero row occurs if 7^ = 0. From this and the definition of all matrices involved we 



obtain B<yZ-^l - A)-^C 



v-7« Sy) V 



i = l,...,fc 
j = l,...,n 



G — D. This shows the first identity 



of (1). If 7i = 0, then the i-th row of B{z — A) ^G is zero, and the assertion is correct, 
too. The second equality follows easily from the first one. 



4 



(2) Using the fact that G is minimal if and only if the highest coefficient matrix {gjj^ ■ G F'^^" 
has rank k (see j^^ p. 495]), this part follows directly from the definition of A and C . □ 

It is well-known that the encoding process of the convolutional code generated by G can 
be described by a linear shift register. The matrix version of this is exactly the controller 
canonical form along with the corresponding dynamical system as given in part (1) of the 
next theorem. We will give the interpretation right after the proof. 

Theorem 2.3 Let G G F[2;]'^^" he a generator matrix and A, B, G, and D be as in Def- 
inition UHl Let u = J2t>o'^tz^ G ^ii^))'' ^^d. v = X]t>o^*-^* ^ ^((-^))" ^^^Z define x = 
uB[z-^I - A)-^ = zuB{I- zA)-^ £¥{{z)y. 
(1) Then x = X^^i xtz^ (i. e., xq = 0) and 



xG + uD j \ vt = XfG + UfD 
(2) Ifu = {u^^\ . . . , u^''^) e F[z]*'', i. e. u is polynomial, then x £ ¥[z]'^ and 

deg X = max {ji + deg u^*^ \ i = 1, . . . ,k, ji ^ O}. 
Moreover, if G is minimal, then 

deg(nG) = max + degu^*^ \ i = 1, . . . , A;}. 

We call xt € F'*' as in (1 ) the state of the encoder at time t given that the input is u as above. 
The space F"^ is called the state space of the encoder G. 



The state-space realization xt+i = xtA + utB, vt = XfG + utD has been introduced in [T^ 
and has also been discussed in [HE]- It is different, however, from the state-space system 
used in 1201 El [T7| . In those papers the codeword is made up by the combined input and 
output, while in our case the codeword coincides with the output of the system. 

Proof: (1) From Lemma (212^1) we have v = uG ■^=> v = u{B{z~^I - Ay^C + D) and 
using the definition for x the first equivalence follows. The other one follows by equating like 
powers of z. 

(2) From ()2.1|1 we have x = {u^^^z, . . . ,u^^^z"''^, . . . ,u'^^^z, . . . ,u^^^z'~''') where for 7j = the 
corresponding block is missing. From this the first assertion follows. The second one is one 
of the well-known characterizations of minimal matrices, see p. 495]. □ 

Notice that if all row indices 7^ are nonzero, then the matrix B has full row rank and 
deg a; > degu. If 7^ = for some i, then keri? 7^ and the inequality degx < degu might 
occur. In any case one has degx < deg(wG). 

Obviously the dynamical equations xt+i = XfA + UfB, vt = XfC + utD describe the 
input-state-output behavior of the canonical shift register realization of the encoder G. The 
inputs at time t are given by the sequence ut, the state vectors xt G represent the contents 
of the memory elements of the register at time t and vt is the output at that time. Part (1) 
above tells us in particular xq = 0, which is the usual assumption that the shift register is 
empty at the beginning of the encoding process. Part (2) shows that if the input sequence is 
finite, then the memory is finite, too. It is zero after a certain number of steps depending on 
the length of the k different memory series and on when the entering input sequence is zero. 
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Example 2.4 Let q = 16 and 



G 



a + az + z'^ + az + a^^z^ a^^ + az + ot'z^ 



l + z 



a-^° + a^z 



GFi6[z] 



2x3 



where + a + 1 = 0. Then the multiphcation 

uG = Y,utz'G = Y,{ul'\ul'^)z'-G = Y,'^'^'^ -^'^ ^'(^^^^^ 



t>0 



t>0 



ut - vt 

is reahzed by the following linear shift register, shown at time t. 



E 



VtZ = V 




The controller canonical form is given by 
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all 



The dynamical equations xt+i = xtA + utB^ vt = xtG + utD with the state at time t being 
xt '■= (n[ j*^ , u| j*2 ) ^1-1 ) describe exactly the input-state-output behavior of the shift register. 

From now on we will always assume that the generator matrix G G F[z]*''^" of the 
code C C F((z))" is minimal with Forney indices 71, . . . , 7^ and overall constraint length 7 > 0. 
Moreover, we define the matrices A, B, C, and D as in Definition 12.11 

Lemma 2.5 Let u € F[z]'^ and v := uG € Cpoi- Assume v 7^ and let degv = N > 0. Put 
X = uB{z^^I — A)^^ G F[2]'^. Choose L G {1, . . . , N}. Then the following are equivalent. 

(i) XL = 0, 

(a) V = V + v where v, v & Cpo}\{0} and degv < L, v G z^F[z]". 
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This lemma simply states that if the shift register is back to the zero state (at time t = L), 
then one may regard the information before and after that time instance as two separate 
information messages and the associated codewords as two separate codewords. 

Proof: Recall that degx < degv = N. 

"(i) =^ (ii)" : Put X = Y^^Sq xtz^ and x = "^^^=1+1 ^tz^- li L = N, put x = 0. Then x = x + x 
and uB = x{z~^I — A) + x{z~^I — A). Writing u = u + u where degtt < L and u S z'^F[2:]'^, we 
obtain deg(u-B) < L and uB G z-^F[z]'''. Therefore, uB = x{z~^I — A) and uB = x{z~^I — A). 
Now put V :=uG = u{B{z~^I - Ay^C + D) = xC + uD and v ■= uG = xC + uD. Then it 
is easy to see that (ii) is satisfied. 

"(ii) =^ (i)": Let v = uG and v = uG, hence v = {u + u)G. Then basicness of G implies 
u G z^F[z]'^. Moreover, since G is a minimal, we have 

deg-u = maxjdeg tt^*-' + 7j | i = 1, . . . , A;}, 

where u = {v}^^\ ■ ■ ■ ,u^^^), see jSj p. 495]. Thus the assumption degi; < L implies degn*^*) < 
L - 7j for alH = 1, . . . , k. Now, x = uB{z~^I - A)~^ + uB{z~'^I - A)^^ and from ((TT|) we 
obtain that deg{uB{z-^I - A)-'^) < L and uB{z-^I - A)-^ G z^+^F[z]'^. Thus xz, = 0. □ 

The above gives rise to the distinction of codewords into those which are the sum of 
two non overlapping codewords and those which are not. For counting weights it will be 
advantageous to make an even finer distinction. We will introduce this only for polynomial 
codewords. The generalization to infinite codewords is obvious. 

Definition 2.6 Let C = imG and v G Cpoi such that vq 7^ 0. Let L G N. 

(a) The codeword v is called concatenated at time t = L ii 

V = V + V where v, v £ Cpoi\{0}, degi; = L — 1, u G z^F[2;]"'. 

If additionally, vl ^ 0, we call v tightly concatenated at time t = L. 

(b) We call the codeword concatenated if it is concatenated at some time instance t = L. We 
call it tightly concatenated if each of its concatenations is tight. 

(c) If V is not concatenated, then v is called atomic. 

(d) If V is tightly concatenated or atomic, then v is also called molecular. 

Parts of this definition can also be found in Several comments are in order. First of 

all, we consider only polynomial codewords that start at time t = 0, i. e., vq ^ 0. This is 
certainly no restriction when it comes to computing the weight. Secondly, it is obvious that 
each such codeword is the concatenation of atomic codewords. Thirdly, from Lemma 12.51 we 
know that if v is concatenated at time t = L then the state at time t = L satisfies xl = 0. 
In this case, the dynamical equations in Theorem 12. .Sf l) show that v is tightly concatenated 
at time t = L if and only if ul 7^ (since the matrix D is right invertible). Thus for 
a non-tightly concatenated codeword the shift register is zero and the input is zero for at 
least one time instance before nonzero input is entering again. For a tightly concatenated 
codeword the shift register is zero, and there is immediately nonzero input being fed into 
the system. If v is concatenated at time t = L, but not tightly concatenated, then we have 
XL = xl+1 = 0. However, if v is tightly concatenated at time t = L, then it also might 
happen that xl = xl+i = 0. This is because the matrix B might have a nontrivial kernel. 
All this is best visualised by using the state diagram. It will be advantageous to define it in 
such a way that it only captures the molecular codewords. 
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Definition 2.7 

(a) Let s := and write F''' = {Xq, Xi, . . . , X_i} in an arbitrary ordering such that Xq = 0. 
We define F''' to be the state space of the encoder G and the state diagram of G as the 
labeled directed graph given by the vertex set {Xq,Xi, . . . , X-i} and the set of edges 

^Xi^^Xj M e F^, ?; G F" : Xj = X^A + uB,v = XiG + uD, (X, u) / (0, 0)} 

(b) A path of length I is a sequence of edges of the form 

X,,_iV> Xi,^^ '-^i^Xi^J^^Xi^. (2.2) 

(c) The path 1)2.2^ is called a cyc/e around Xi^ if Xj,, = X; • 

(d) The weight of the path H2.2|) is defined as X^[=o ^t(?;j). 

(e) Let u, X, v := uG be as in Theorem 12.31 If f G C\Cpoi we call the infinite path 

(iifl.) (iiL) (ii2.) 

— > T-i A — ^ To ^5 — ^ 



= Xo Xi '"-^ X2 X3 



the path associated with the codeword v = uG. In case v G Cpoi and degv = X, we have 
xn+1 = (since degx < degf) and we call the finite path 

(iiiL) (iiL) (iia.) (iijv ) 

= Xq ^ Xi ^ X2 ^ X3 Xat X7V+I = U 

the cycle around zero associated with the codeword v = uG. 



Note that the edges of the state diagram correspond to the transitions in the canonical shift 

(— ) 

register: Xj — - — >Xj is an edge if for some time instance t the memory vector is given by Xj 
the input is u, and this leads to the next memory vector Xj and the output v. The only 
exception is the state Xq = together with the input u = 0. This leads to the next state 

Xj = XqA + uB = and v = XqC + uD = and this transition (or the edge — ^^0) is 
not included in the state diagram. Hence there emerge edges at each vertex except for the 
vertex Xq = at which only q'' — 1 edges emerge. 

As a consequence, the state diagram contains all information about the encoding process. 
The cycles around Xq = correspond to the molecular codewords in imG (notice that the 

edge — corresponds to the situation xl = and ul = occurring only for concatenated, 
but not tightly concatenated, codewords). The message sequence determines the path through 
the graph and the corresponding ii-labels yield the associated codeword. Note also that it 
is possible to have two different edges between the same vertices, i. e. edges of the form 
(— ) (-^) 

Xj — - — > Xj and Xi — - — > Xj where u ^ u'. This happens if and only if 7; = for at least 
one I as can easily be seen from the matrix B. In this and only this case there are also edges 
of the form — - — >0 in the state diagram such that u G keri? is not zero. 

Since the state space of Example 12.41 has 16^ elements, we are not able to explicitly 
display the associated state diagram. We will rather restrict ourselves to the following smaller 
encoder. 
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Example 2.8 Let G = {I + z + z"^ + z^, 1 + + z^) € ¥2[z]^''^. Thus, n = 2, k = 1, and 
7 = 7i = 3. The state diagram has s = 2^ = 8 vertices. We obtain the matrices 

/I 0\ 

5= (1 0) , C = I 1 ij , D = (1 1) . 

Going through all options for the equations xt+i = XfA + utB, vt = XfC + UfD yields the 
state diagram 





The following lemma will be useful for the investigation of cycles of weight zero below. 
Lemma 2.9 There does not exist a cycle of the form 



-^io ^^ii ^^12 ^ ^^i; — -^io 



in the state diagram. 



Proof: Suppose we have such a cycle. Then Xi^ = Xi^A, Xi^ = Xi^A^ and eventually 
= Xi^Ai. Running possibly several times through this cycle we may assume without 
restriction that I is bigger than the nilpotency index of A. Thus = and hence Xi^ = 0. 

(") 

^ ) does not belong to 

□ 



But then also = 0, which is a contradiction since the edge 0- 
the state diagram. 



It is possible to characterize basicness, thus non-catastrophicity, of the encoder matrix 
in terms of the state diagram. As opposed to the main line of this section we do not make 
any assumptions on the matrix G in the following proposition. The controller canonical form 
and the state diagram can be defined exactly the same way as for basic polynomial matrices. 
Recall from matrix theory that a matrix G G F[2;]'^^" is basic if and only if rankG(A) = k for 
all A in an algebraic closure of F. 

Proposition 2.10 Let ¥ be an algebraic closure of F and let G G F[2;]'^^"' have rank k. 
Consider the associated state diagram. Then 

( ^ ) 

(1) rankG(O) < k <^=^ tiiere exists an edge — ^^—>-X for some u G F'=\{0} and X G F^. 

(2) rankG(A) < k for some A G F\{0} <^=> there exists a cycle of weight zero. 



Proof: (1) is clear since G(0) = D. 

(2) : It is well-known that the assumption implies the existence of some u = J2t>o ^ 
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F[zf \F[z]^ such that v = uG = Y.t=o ^tz^ ^ ^\^T^ hence = for t > L, see (131 Thm. 6.3]. 
Let X = uB{z~^I — A)~^ = Ylt>o ^^e corresponding state sequence. Hence each xt is 

a vertex of the state diagram. Since the state diagram has only finitely many edges, but the 
information sequence has infinitely many nonzero blocks ut, there have to be some t2 > ti > L 
and a vertex X such that xt^ = X = xt2- Since tij = for t > L this gives us a cycle of 
weight zero. 

"<^=" : Suppose we have a cycle of weight zero and length / in the state diagram, say 



X. 



«0 



Xi, 



( — tJ~' 



X., 



y — D — ), 



H-l 



Xi, 



where = Xi^ =: X. By Lemma 12.91 not all Uj are zero. There exists a path from the state 
Xq = to the state X, say 



^32 



3T 



X. 



This can be easily seen since the existence of such a path is equivalent to the existence of 
(no, • • • , ut-i) such that 

(BA^-^\ 

X = {uq, . . . ,UT-l) 



V 



B 



J 



which in turn follows from the full column rank of the right hand side matrix if we choose T 
big enough. Put u := J2\zl utz* and u = Y^^~^ utz* + Y.Zo ^-2^"^*' ^ H^t- Then u ¥[z]'', 
since u 7^ 0, but v = uG = ^1^=0 ^t-^* ^ F[z]". By ^1 Thm. 6.3] this implies the left hand 
side of (2). □ 



3 The adjacency matrix and the weight distribution 

In this section we will show how to compute the weight distribution of a convolutional code 
in terms of the state diagram. A main tool will be the adjacency matrix associated with the 
state diagram as it has been defined in |14j . In slightly different forms this matrix appears 
also in other papers on convolutional codes, see for instance IHl Sec. 3.10]. Theorem 13.81 has 
been derived in Jl]. First of all we need a reasonable definition for the weight distribution. 
Of course, it is sufficient to count only the weights of atomic codewords. In order to do so, 
we need to show that we are dealing with finite numbers if counted appropriately. This will 
be dealt with in the first lemma. 

General Assumption 3.1 From now on we will always assume that the generator matrix 
G € F[z]'^^" of the code C C F((z))" is minimal with Forney indices 71, ■ ■ ■ ,"fk and overall 
constraint length 7 > 0. Moreover, we define the matrices A, B, C, and D as in Definition l2.11 
Finally, let s := q"' and F'^ = {Xq = 0, Xi, . . . , Xg-i} be the vertices of the state diagram in 
some fixed ordering. 

Lemma 3.2 Assume the memory of G, i. e., the maximal Forney index, is given by m. 

(a) Let V = uG G Cpoj where u = J2i=o'^i^^ ^ ¥[z]^ with uq ^ ^ ul- If ui^i = u/+2 = 
. . . = ui+m = for some / E {0, . . . , L — m — 1}, then v is concatenated. 
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(b) For all a € No we have ^{v € Cpoi \ v atomic, wt{v) < a} < oo. 

Proof: (a) Write u = J2''i=o'^i^^ + X]f=z+m+i ^i-^* =: u + u. Then deg('uG) < I + m and 

iiG € 2;'^™^^F[z]"', thus v = uG + uG is concatenated at some time t < I + m + \. 

(b) Let G € ¥[2]"^*"' be a right inverse of G and let G have maximal row degree m. Suppose 

V = uG G Cpoi is a codeword with at least m + m consecutive zero coefficients. Thus, v = v + v 
where v, v € F[z]"\{0} satisfy degv < L and v G z^+m+m+iY^^jn ^^j. ^^^^ ^ ^ pj^^ Then 
u = uGG = vG + vG and part (a) shows that v is not atomic. All this proves that atomic 
codewords do not have more than m + m — 1 consecutive zero coefficients. As a consequence, 
all atomic codewords of weight at most a have a degree bounded by some € N proving 
the assertion. □ 

Remark 3.3 It is not hard to prove that if all Forney indices of G are equal to m then part (a) 
above becomes an if-and-only-if statement. Indeed, let v = uG G Cpoi be concatenated as 

V = V + V where v, v ^ Cpoi\{0} and degv < T and v G z-^+^F[z]" for some T < degu — 1. 
We have to show that ut = ut-i = ■ ■ ■ = UT-m+i = 0. From Lemma 12.51 we know that 
xt+1 = 0, thus we obtain recursively 

= xtA + utB = XT-iA^ + ut-iBA + utB = ... 

(3.1) 



XT-m+lA^ + {uT-m+1, ■ ■ ■ ,Ut) 



BA 



It follows directly from the definition of the controller canonical form that if all Forney indices 
are m, then A^ = and the matrix on the very right of (|3.H) is a non-singular 7 x 7-matrix. 
Hence H3.1|) yields the desired result. 

The above makes the computation of the distance of a given code to a finite problem, at least 
theoretically. 

Definition 3.4 For all a, Z G N define 

^i,a '■= £ Cpoi I V atomic, degu = / — 1, wt(f) = a}. 

CO 00 

The power series = L) ■=^^ ^i^aW^L^ e QlW, L] is called the weight distribution 

1=1 a=l 

of the code C. 



Observe that the weight distribution is an invariant of the code and does not depend 
on a chosen generator matrix. The definition above is somewhat unusual as it omits the 
constant term 1 representing the zero codeword. We will see later that our definition is more 
convenient. Note that the numbers ivi^a are indeed finite. Moreover, since each codeword of 
degree I — 1 has at most I nonzero coefficients in F", we have X^^i ^i,aW^ = X^^Li ^i,aW^i 
thus 

CO nl 

^ = EE^'."^"^' ^^[^IW- (3.2) 

1=1 a=l 
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Notice also that for block codes we simply have = Yl^a=i ^i,aW°'L. This is, up to the factor 
L, the ordinary weight distribution for block codes where the constant term 1, representing 
the zero codeword, has been omitted. In the general case we also have for each a G N that 
Yl'^o^i,aL^ is a finite sum, due to Lemma lc{.2r b). Hence 

oo oo 

^ = EE^'."^'^"^'Q[^HW^1- (3.3) 

a=l 1=0 

Finally observe that 

dist(C) = min |a G N 3 / G N : / o| 

is the degree of the smallest term ocurring in the series expansion with respect to W. 

In the sequel we will show how one can compute, at least theoretically, the weight dis- 
tribution of a code using the state diagram. All necessary information is contained in the 
following matrix A. Recall General Assumption 13.11 For ease of notation rows and columns 
of s X s-matrices will always be indexed with i, j = 0, . . . , s — 1. 

Definition 3.5 For all G {0, . . . , s — 1}^ and all a G {0, ... , n} define 



\j - 



0, if (i,j,a) = (0,0,0), 

#{n G I Xj = XiA + uB, wt{XiC + uD) = a}, else. 



Furthermore, define the matrix 

A:=[^AgV"] GQ[W]^^^ 

\a=0 / ij=0,...,s-l 

We call A the adjacency matrix of the encoder matrix G (or of the state diagram) . 

Notice that A,-" is the number of all edges in the state diagram of the form Xi — - — > Xj where 

wt(u) = Q. Obviously the weight is bounded by n. The identity Aq'^q = reflects the fact 

that there is no edge of the form — in the state diagram. One should also observe that 
the adjacency matrix depends on the choice of the encoder G, thus on the matrices j4, B^ C, 
and D as well as on the numbering of the states. We will discuss this issue in Section 0] in 
detail. 

Example 3.6 Let F = F2. 

(1) Let G = f i*, ^ , ° I G FM2X3. Thus 7 = 1 and s = 2 and A = (o) , B = 
yOz + lzy ^ ' 

, C = (0 1 1) , D = J Jj^ . With this one obtains straightforwardly 
2W^ W + 



A 
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(2) In Example 12.81 we have s = 8 and obtain, with an appropriate ordering of the states, 
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The following proposition will lead us to a computation of the weight distribution via 
the adjacency matrix A. 

Proposition 3.7 Let / G N. For € {0, . . . ,s — 1}^ denote the entry at the position 

ofA^ by 

nl 

Then xf'"^ is the number of paths in the state diagram from Xi to Xj of length I and weight a. 
In particular, Xqq^ = for all I € N. 

Note that it is clear that the entries in A' are all polynomials of degree at most nl. 

Proof: We proceed by induction on I. For I = 1 the assertion is just the definition of the 
adjacency matrix A. Assume the assertion is true for / > 1. Then 

s—l s—l n nl 

n{l+l) s-1 a 

= E EE^f'^r''^'' 

a=0 u=0(3=0 

By induction hypothesis X^^'"' is the number of paths from X^ to Xj of length / and weight 

a — (3 and likewise A^^^^^^ is the number of edges from Xi to X^, and weight /?. Since all paths 
from Xi to Xj go through exactly one state Xi, after one step, we obtain that the total number 

of paths from Xi to Xj of length / + 1 and weight a is given by '^tZ}oYl'p=o ^i^u^^ ^uf 
This proves the first assertion. The second one is clear since there are no cycles of weight 
zero in the state diagram according to Proposition 12. 1()T 2). □ 

Now we are in a position to present the main result about the weight distribution. Recall 
that only atomic codewords are counted. According to Lemma 12.51 thev correspond to cycles 
around the state Xq = in the state diagram that do not pass through this state in the 
meantime. The number Xq'q^ counts all cycles around Xq = 0, but does not take into 

account whether it passes through this state in the meantime. Therefore, Xq'q^ is the number 
of molecular codewords of length I and weight a (this will also become clear from the proof 
below). The next theorem shows how to obtain from this the weight distribution O. For the 
sake of completeness we also add the proof as it can be found in [TI] . 
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Theorem 3.8 (McEliece HH Thm. 3.1]) Let ^> := 1+E£i Ea=i ^0,0 £ Q[W^]M- 

Then 

(a) <I> = ((/ — LA)^^)qq G Q(W^,L), the entry in the upper left corner of the rational 
matrix {I - LA)-^ € Q{W, L^'. 

(b) n = i- $-1. 

This is also comphant with the block code case in which A = E^gc\{o} W'^^^'"^ G Q[W^] and 

n = AL. 

Proof: (a) is seen via 

00 00 00 

* = 1 + J:(A%,L' = (/ + T.(AL)') = {E(AL)') = ((/ - AL)-') . 

1=1 1=1 ' /=0 

(b) Recall from Equation (jHIS) that Q = J2Zi YTa=i ooi^aW'^lK We first show that for r > 1 
the coefficient of W^L^ in the power fi'' is the number of all codewords of weight a and 
degree / — 1 that consist of r tightly concatenated atomic codewords. In other words, it is the 
number of all cycles around Xq = in the state diagram that have length / and weight a, 
and that pass through the zero state exactly r — 1 times except for start and endpoint. This 
can be proven by induction on r. Precisely, let = Yl'i^i X^oLi i^/ ^l^"-^') then 

/-I a-l 
(r+1) (r) 

l'=l/3=l 

and is the number of all atomic cycles around Xq = of length /' and weight /? while 

(r) 

^i-i' a-p induction, the number of all cycles around Xq = of length / — / and 

weight a — 13 that pass exactly r — 1 through Xq = except for start and endpoint. Since 
each cycle consisting of r tightly concatenated atomic cycles can be obtained in a unique 
way by tightly concatenating these types of cycles we obtain the desired assertion about the 
coefficients of Q"^ . 
Using Proposition 13.71 we conclude 

^ = 1 + ^ + ^'^ + + ... = S^vt'' = — - 

^ 1-0 

and this yields $7 = 1- (^^^ . □ 

Observe that the proof shows indeed that $ is the weight distribution of the molecular 
codewords. 

If one follows the arguments above one has to perform the following steps in order to 
compute the weight distribution of a given code: 

(1) Compute A. 

(2) Solve the equation (/ - LA)x = (1,0,..., Of for x G Q{W, Lf . 

(3) Then $ = xi and SI = 1 - 

While (1) and (3) are easily done for reasonably sized parameters, step (2) quickly becomes 
unpractical with growing overall constraint length 7 and/or field size q since the size of the 
adjacency matrix is x q^ . Better algorithms for computing the weight distribution while 
avoiding the big adjacency matrix can be found in [2l I15| ITfi]. 
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Example 3.9 Consider F = F2 and G := [l + z + z"^ + 1 + + z^). The matrix is 
basic and, of course, minimal. In this case the adjacency matrix is only 8x8 and we can 
perform the computation of the weight distribution along the steps (1) ~ (3) above by using, 
for instance. Maple. We computed the adjacency matrix A already in Example I3.6f 2). From 
that one obtains the weight distribution 

n = l'^w^{l+w-lw^)/{i-lw-l'^w+l^w'^-l^w^-lV-l^w'^+l'^w'^) 

+0{W^°). 

Thus, the distance is 6 and the code contains exactly one atomic codeword of weight 6, it 
has length 5. It contains three atomic codewords (of length 4, 6, and 7, respectively) of 
weight 7 and 5 atomic codewords of weight 8, two of which have length 9 the other three 
have length 6, 7, 8, respectively, etc. 

At the end of this section we want to briefly mention two distance parameters appearing 
in a totally different context in the literature that are closely related to our notions. In [01 
p. 541] the extended row distances dj are defined. The definition shows immediately that 

(i[ = min{wt(t>) | v atomic, deg(f) = /} = min{a G Nq | <^i+i,Q 7^ 0} 

where toj^a are given in Definition 13.41 Thus these distance parameters can be recovered, at 
least theoretically, from the weight distribution 17. There are also other definitions of the 
extended row distances in the literature. They are slightly different and in general not that 
closely related to atomic or molecular codewords and some of them even depend on the choice 
of the (minimal) encoder. 

In [BJ p. 155] the /th order active burst distance of the code C = imG is defined as 

:= min{wt((tiG)[Q_^]) | = and (xj, Xj+i, Uj) / for all < i < /}, 

where, as usual, x is the associated state sequence, see also Sec. 3.2]. Moreover, {uG)[qj] 
denotes the codeword truncated after the Ith power of z. As we will show now, if G is minimal 
then 

= min{wt(f) | v molecular, deg ?; = 1} = min{a S N [ ^^q^^'"'^ 7^ 0} (3.4) 

where xf'"^ are defined as in Proposition 13.71 The second identity follows directly from the 
discussion right before Theorem 13.81 As for the first identity, note first that the condition 
{xi, Xi+i,Ui) 7^ simply means that the codeword uG is molecular. Moreover, Lemma 12.51 
along with = implies that v := (nG)[o,/] is a codeword. Finally we have degv = I, 
which can be seen as follows. Suppose degv < I. Since G is minimal we have degn < degv, 
see Theorem 12.3r 2). Hence ui = = vi. Using the controller canonical form (A, B ,C, D) 
we obtain = = xiA and u / = = xiG, and Lemma 12.21^ 2) implies xi = 0. Hence 
{xi,xi^i,ui) = which contradicts the choice of v. All this together shows that w is a 
molecular codeword of degree /. Conversely one can easily see that each such codeword has 
a state sequence x such that (xi,Xj+i,nj) 7^ for all < I. This proves the first identity 
of (|3.4|) . Hence the active burst distances occur in the series ^> as used in Theorem 13.81 for 
enumerating the molecular codewords. In particular we have that min;>o equals the free 
distance of the code, see also |H1 Thm. 3.8]. 
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4 The adjacency matrix as an invariant of the code 



In this section we will prove that the adjacency matrix is an invariant of the code, i. e., does 
not depend on the choice of the minimal generator matrix, provided that one factors out the 
effect of the (arbitrarily chosen) ordering of the states. As to our knowledge the result of this 
and the next section are new. 

Throughout this section let the data be as in General Assumption 13.11 It is clear from 
Definition IH. 51 that the adjacency matrix of G depends on the ordering of the states. Suppose 
now we have fixed two different orderings on the state space F'^, each one satisfying Xq = 0. 
The we obtain two (different) adjacency matrices Ac and A^ and it is clear that 

A^ = IIAgII^^ for some permutation matrix 11 E G/s(Q) such that IIo^o = 1- (4.1) 

Again, we will always index the rows and columns of matrices in Q[iy]*^* by i, j = 0, . . . , s— 1. 
We define 

Q := {n [ n € Gls{Q) is a permutation matrix and IIo^o = 1} 
to be the group of these specific permutation matrices and the action 

OxQiwY""' — >q[wY''', (n,A)i — .nAn-^ 

This yields a group action on Q[H^]^^* and we obtain the quotient space Q[M^]*^*/^ of all 
equivalence classes 

A := {nAn~^ I n G a g q[i^]"^". 

All this gives us a well-defined mapping G i — > Aq from the set of all minimal generator 
matrices with overall constraint length 7 into Q[VF]*^*/(^ by simply choosing Ac as the 
adjacency matrix of G with respect to any arbitrary ordering of the state space such that 
Xq = 0. We will show now that A^ is even an invariant of the code. Indeed, we have 

Theorem 4.1 Let G, G' G F[z]'^^" be two minimal generator matrices such thatC := imG = 
imC. Then 

A^ = A^. (4.2) 

Thus, the adjacency matrices of G and G' differ only via conjugation hy some matrix in Q. 
We will write A{C) := Aq for this invariant and call it the generalized adjacency matrix of 
the code. 



The theorem tells us that the adjacency matrix is, up to the group action of ^, an invariant 
of the code. In other words, the generalized adjacency matrix is a well-defined mapping 

A : {C C Fg((z))" I C code with overah constraint length 7} — > Q[W]'^^'"^^ /g 

C ^ A(C) ^^'^^ 

Proof: Let i/i,...,z^fc and fii,...,fik be the row degrees of G and G', respectively. By 
assumption and (jl.lj) we have 

G' = UG for some matrix U G Gk{¥[z]) (4.4) 

and we also have {vi, . . . , u^} = {fJ-i, . . . , ^k}- Let 7 := J2i=i — Yli=i f^i be the overall 
constraint length of the code. 
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Since every unimodular matrix U G G/fc(F[z]) is the product of elementary matrices, we may 
show the result for each type of elementary transformation separately. In the rest of the proof 
we fix an ordering on the state space such that Xq = 0. Moreover we define (vl, -B, C, D) and 
(A' , B' ,C' , D') to be the controller canonical forms of G and G' , respectively. 
1) We show that a permutation of the rows of G results in a conjugation of Aq just like 
in (|4.1|) . Thus let us assume G' = UG where U permutes the ith. and jth row of G. Then A' 
is obtained from A by permuting the ith and jth block row and column, B' is obtained from B 
by permuting the ith and jth row and the ith and jth block column, G' is obtained from G 
by permuting the ith and jth block row and finally D' is obtained from D by permuting the 
ith and jth row. Thus, there exists a permutation matrix P G Gl^{¥) such that 



A' = PAP-^, B' = UBP 



-1 



(this is also correct if or is zero 



G' = PG, D' 

> Ar, 



UD 



gram of G, thus (Ac);,™ 
this we obtain X^P^^ = 



Now let Xi ' " ' > Xjn be an edge in the state dia- 
{XiP-^)A' + {uU-^)B' and v = {XiP-^)C' + {uU-^)D' , hence 



XiP 



-) 



X'VYiP 



is an edge in the state diagram of G'. Since the mapping P : F'^ ^ F'^ 
is simply a permutation of the states with XqP = Xq, we arrive at nAcII^^ = An' for some 



LQ, we arrive ai iiivcii = ivq' 
suitable permutation H (z Q. This in turn implies (|4.2|) . 

2) Next we consider the case where the matrix U in (|4.4j) multiplies the rows of G with some 
nonzero constants, say U = diag(tii, . . . ,Uk) € G/fc(F). Then 



A' = A,B' = B, G' = UG, D' = UD 



where 



/nil, 



U 



^1 



U2I, 



V2 



\ 



Here Ij denotes the j x j-identity matrix. It is easy to see that U A = AU and UB = BU, 
thus A' = A = UAU~^ and B' = B = UBU~^. Using the same arguments as in case 1) we 
arrive at nAcII"^ = Aq/ for some 11 G ^. 

3) Now we consider the case where U adds a constant multiple of one row to another. Because 
of part 1) of this proof we may assume I'l = fJ-i < ■ ■ ■ < I'k = fJ-k- Since we did already part 2) 
of this proof and since G and G' = UG are both minimal we only have to consider the case 
where the jth row is added to the ith row and j < i. Hence U = 1^ + E where E G f'^^'^ has 
a 1 at position (i, j) and elsewhere. Let us first assume i^j > 0. Put 



U 



M 



G Gl-y(¥) 



(4.5) 



where 



M 
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Then we have 

A' = A, B' = B, C = UC, D' = UD. 

Furthermore, it is easy to see that MAj = AiM and BiM = Bj where Ai, Bi are the diagonal 
blocks of the matrices A, B as given in Definition 12.11 From this we obtain UA = AU and 
UB = BU. Now we can use the same arguments as in case 2) to finish the proof. If = 
we have A' = A, B' = B, C = C, and D' = UD. Using U := and the fact that the jth 
row of B is zero, we have again UB = BU and we can argue as before. 

4) Finally we have to consider the case where the matrix U adds a non constant multiple of 
one row of G to another. Without restriction we may assume that times the jth row is 
added to the ith row. Since G' is supposed to be minimal again, we have I < Vi — Vj. Hence 
U = Ik + E where E € ^^^^ has the entry at position and elsewhere. Let again 
first Vj > 0. Consider U as in (|4.5j) but where M now is of the form 



M= \ eF^»^^^ 




Then we obtain 

A' = A, B' = B, C' = UG + ED, D' = D 

where E G F'*'^'^ has a 1 at position (r, j) with r = X]r=i ^t + ' and elsewhere. Furthermore, 
AiM = MAj + N where iV G F'^'^^'j has a 1 at position (/,1) and elsewhere. Thus 
All = If A + N where N € F'''^'^ satisfies Nr^t = 1 with r as above and t = v^- + 1 and 

all other entries are zero. Moreover, one has EB = N, since Vj > 0, as well as BU = B 

(— ) 

since / > 0. Suppose now that X — - — > X' is an edge in the state diagram of G. Then 
X' = XA + uB and v = XG + uD. One computes 

X'Lf-^ = {XU-^)A' + (u - Xtj~^E)B' and v = {Xty~^)C' + (u - XU-^E)D'. 

Thus, putting u = u — XU^^E, we obtain that XU^^ — - — >X'U^^ is an edge of the state 
diagram of G'. Again U^^ simply permutes the states and we obtain XIAgII"^ = Aq' for 
a suitable permutation 11. In the case Vj = we have A' = A, B' = B, C = C + ED, 
and D' = D where E is as before. Since the jth row of B is zero, one has EB = 
and thus the equations X' = XA + uB, v = XC + uD are equivalent to the equations 
X' = XA' + (n — XE)B', v = XC + {u — XE)D' and we can argue as above. This completes 
the proof. □ 

Notice that the proof also shows how the controller canonical form changes under uni- 
modular transformations of the minimal generator matrix. However, we will not need that 
result explicitly. 

Next we will briefiy turn to monomially equivalent codes. Defining monomial equiva- 
lence just like for block codes it is straightforward to show that it preserves the generalized 
adjacency matrices, see Theorem 14.31 below. In the next section we will see that for certain 
classes of codes even the converse of that theorem is true. 

Definition 4.2 Two matrices G, G' G F[z]'^^"' are called monomially equivalent if G' = 
GPR for some permutation matrix P G G/„(F) and a non-singular diagonal matrix R G 
G/„(F). Thus, G and G' are monomially equivalent if and only if they differ by a permutation 
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and a rescaling of the columns. We call two codes C, C C ¥{(z))^ monomially equivalent and 
write C ^ C if C = iuiG and C = im G' for some monomially equivalent generator matrices. 
Furthermore, we write 

[C] := {C C F((z))" I C ~ C} 
for the monomial equivalence class of the code C C F((z))". 

It is clear that monomially equivalent codes have the same distance. As we show next they 
even have the same generalized adjacency matrix. 

Theorem 4.3 Let C, C C F((z))" he two convolutional codes. Then 

C^C' ^ A(C) = A(C'). 

Proof: Let G, G' G Ffz]'^^" be generator matrices of C and C, respectively, such that 
G' = GPR for some permutation matrix P and a nonsingular constant diagonal matrix R. 
Then the controller canonical forms satisfy A' = A, B' = B, G' = GPR,D' = DPR. Thus, 
is an edge in the state diagram associated with G then X — X' is an edge 
in the state diagram of G' . Since wt(f) = wt{vPR) we obtain Aq = A^/ if we fix for both 
codes the same ordering on the state space. This implies the desired result. □ 

As a consequence we have a well-defined mapping 

A : {[C] I C C F((z))" code with overall constraint length 7} — > Q[Wf^ g 

[C] ^ A(C) ^^-^^ 

In the next section we will discuss some properties of the mappings A and A. 



5 The adjacency matrix as a complete invariant for one-dimen- 
sional binary codes 

In this section we will derive some results about the generalized adjacency matrix. The 
guiding question is as to what properties do codes share if they have the same generalized 
adjacency matrix. We will first show that such codes always have the same Forney indices. 
Secondly, we will show that binary one-dimensional codes with the same generalized adjacency 
matrix are monomially equivalent. We conjecture that this is also true for one-dimensional 
codes over arbitrary fields. However, it is certainly not true for general higher-dimensional 
codes as we know from (binary) block code theory, see [71 Exa. 1.6.1]. As a consequence, we 
obtain that if two binary one-dimensional codes share the same generalized adjacency matrix, 
then so do their duals. This indicates the existence of a Mac Williams duality theorem for the 
adjacency matrices of convolutional codes, and, indeed, such a theorem has been proven for 
codes with overall constraint length one in the paper PP, see (|5.2() at the end of this section. 
The general case has to remain open for future research. 

We begin with showing that two codes sharing the same generalized adjacency matrix 
have the same Forney indices. Observe that two such codes certainly have the same overall 
constraint length since that determines the size of the adjacency matrix. 
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Theorem 5.1 Let C, C C ¥([z))^ be two convolutional codes with the same overaU constraint 
length and such that A(C) = A(C'). Then C and C have the same dimension and, up to 
ordering, the same Forney indices. 

For the proof we will need the following lemma. 

Lemma 5.2 Let the data he as in General Assumption \3.1i Let Aq be the adjacency matrix 
of G with respect to some fixed ordering on the state space such that Xq = 0. Put 
T := Ag + -Eo,o where -Eq.o € Q*^** is the matrix with 1 at position (0,0) and elsewhere. 
Define 

( B \ 

BA 

Pr = rank 

for r S Nq. Then for r > 1 the number of nonzero entries in the first row ofT^' is given by 

nPr-l 



is 



The matrix T is the adjacency matrix of the state diagram where also the edge 
included. We call this the extended state diagram. 

Proof: Just like in the proof of Proposition 123 one shows that r[^- is the weight enumerator 
of all paths from Xi to Xj of length exactly r in the extended state diagram. Thus our 
assertion is equivalent to saying that there are exactly qP^-^ states that can be reached from 
Xq = by a path of length exactly r in the extended state diagram. This is obviously true 
for r = 1 since the existence of an edge 



(— ) 



that Xj = uB and there are q^^^ 
path 

(- 
0— 



Xj is equivalent to the existence of u such 
qP^ different Xj possible. In general, the existence of a 



X 



31 



(-^) 



is equivalent to the existence of ui , . . . , such that 



3r 



Xjr = iUr,Ur-l, . . . 



BA 



XBA^-'J 



showing that this allows for qP^-^ different states X 



3r- 



□ 



Now it is not hard to prove the theorem above. 

Proof of Theorem 15. 11 Let G G F[2:]'^^" and G' G F[z]'^'^"' be minimal generator matrices 
of C and C' and {A, B ,C, D) and {A' , B' ,C' , D') the controller canonical forms, respectively. 
Put r = Ag+Eq^ and F' = AQ/ + EQfi where Eq^ is as in the previous lemma. By assumption 
nAcII"^ = Aq' for some 11 G ^. Then also nFFI"^ = F', since IIo^o = 1- Thus the first rows 
of F and F' coincide up to ordering. From this we obtain k = k' . Indeed, the first row of F 
contains the weight of all edges emerging from the zero state in the extended state diagram 
associated with G. Since there are q'^ such edges we have l^j=o = Z]o=o '^aW°' where 



E 



a=0 ' 



q^ . On the other hand Ej=o ^0,^' = E?=o -'^0,7 ^'^'^ using the same argument this 
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yields k = k' . As for the Forney indices we proceed as follows. We have nr^TI^^ = (T'Y for 
all r G N. Thus, due to the form of 11, the matrices and F'^ have the same number of 
nonzero entries in the first row. Therefore, Lemma 15.21 implies = p'^ for all r € Nq where 



rank 



BA 
\BA') 



I 

Pr 



rank 



( B' \ 
B'A' 

\B'A'^) 



Now let 7i, . . . , 7fc and 7^, . . . , 7^ be the Forney indices of G and G' , respectively. By definition 
of A and B we have 

Pq = TwkB = ^{i I 7i > 0} and ranki^A*" = ^{i | 7i > t} for all r G N. 

Moreover, due to the specific form of the matrices, 



rank 



BA 



yBA^-^j 



+ rankS^''. 



Therefore, 



Pr - Pr-i = #{« I 7i > ^'l for r G N. 
Analogous identities hold true for the other code. Using now p,. = for all r G No it follows 

#{i I 7j > r} = #{i I 7. > r} for all r G Nq. 

This shows that the Forney indices coincide up to ordering. □ 

Now we come to the main result of this section. The proof will make use of the Equiv- 
alence Theorem of MacWilliams about weight preserving transformations for block codes. 
Moreover, a technical lemma for bijections on will be proven in the appendix. Even 
though the lemma does not hold true for arbitrary fields we strongly believe that the follow- 
ing theorem is also valid for one-dimensional codes over bigger fields. 

Theorem 5.3 Let C, C C ¥2{{z))"' be two binary one-dimensional codes such that A(C) = 
A(C'). Then C and C are mononiially equivalent. 



Notice that the result implies that the mapping A in (|4.H|) restricted to binary one-dimensional 
codes is bijective. One should also observe that the theorem above is not true for higher- 
dimensional binary codes since it even fails for binary block codes, see (3 Exa. 1.6.1]. 

Proof: Let F = F2 and G, G' G F[z]^^" be minimal generator matrices of C and C , re- 
spectively. By assumption C and C have the same overall constraint length, say 7. Without 
loss of generality we may assume 7 > 1. Fix an ordering on the state space F''' such that 
Xq = and let Ac, A^/ be the adjacency matrices associated with G and G'. By assumption 
nAc/II"^ = Ag for some n G ^. If 7 = 1 then Q = {I2}, thus H = I2. Next we will show 
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that also in the case 7 > 2 we obtain 11 = I21 ■ In order to do so, let 5*27 be the symmetric 
group on the set {0, 1, . . . , 2"^ — 1} and define vr G 5*27 to be the permutation such that 



n 



/ ^7r(0) \ 
677(1) 

\e,r(27-l)/ 



, n 



-1 



(^I(O)' ^I(l)' • • • ' '^1(27-1))' 



where eo,--- ,627-1 are the standard basis vectors in Q^^. Then '7r(0) = 0. The controller 
canonical forms of G and G' are given by {A, B, G, D) and (^4, B, C , D') where 



A 



( 1 



V 



\ 



g]p7X7^ S = (1,0,... ,0) € 



and C, D, C", D' are defined as in Definition 12.11 We have (Ag')^ .,- = if and only if there 

Xj is an edge in the state diagram of G and wt(t') = a. 



exists n £ F such that X,- 



{— ) 



In the rest of the proof we will use for X = (xi. 



notation X\ 



a,b] 



G F"^ and 1 < a < 6 < 7 the 

(— ) 

{xa, ■ ■ ■ ,Xb). Then the existence of the edge Xi — - — >Xj is equivalent to 



Xj = (n, and, in particular, u is uniquely determined by Xj. On the other hand 

all this is equivalent to {■^-G')TT{i),Tr{j) = (II^G'II~^)i,i = (^G)i,j = W"', hence to the existence 

(-^) 

of an edge — - — > X^(^j^ such that wt(f') = a in the state diagram of G'. Likewise we 
have X^(^j-j = («', (X7r(j)) [1.7-1])- Denote by vr : F''' — > the permutation on F'*' such that 
■7r(Xj) = for alH = 0, . . . , 2"^ — 1. Then ^(0) = and the above gives us the following 

property for the permutation vr: 



[2,7] 



[1,7-1] 



^(^)[2,7] = HXha-i] for all X,Ye¥^ 



In Lemma lA.ll in the appendix we show that this implies tt = id. Thus 11 
consequence we have Ac = for all 7 > 1. From the equivalence 

iAG)i,j = ly" ^ wtiXiG + Xj^iD) = a 

and the corresponding equivalence for (AQ')ij = W°' we finally arrive at 



'27 • 



As a 



wt {Xi, u) 



D 



But then Lemma 15.41 below yields 



wt {Xi, u) 



G' 



G' 
D' 



for all Xi G F'^, u G F. 



for some permutation matrix P G G/„(F). Hence G' = GP meaning that the two matrices 
are monomially equivalent. □ 



It remains to prove 
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Lemma 5.4 Let ¥ be any finite field and let M, M' G F^^" be such that 

wt(uM) = wt(nM') for all u G F*^. (5.1) 
Then M and M' are mononiially equivalent. 

Proof: ^ Let us first assume that M has rank k. Then the assumption (|5.1|) impHes that M' 
has rank k, too. Defining the block codes B = imM and B' = imM' we obtain a well-defined 
weight-preserving bijective linear transformation 

B — > B', uM I — > uM'. 

By virtue of the Equivalence Theorem of MacWilliams, see for instance [3 Thm. 7.9.4] the 
two block codes are monomially equivalent. Thus there exist a permutation matrix P and a 
non-singular diagonal matrix R in G/„(F) such that M' = MPR. 

Let now rankM = r < k and assume without loss of gener ality that M = f^^A where 



V 

Ml G F**^" has full row rank. Then {0,U2)M = for all U2 G F''"'' and (I^TTj) impfies that 

M' = (^^'1^^^ for some M[ G F''''" with full row rank. Now we have wt(iiiMi) = wt(niM{) 

for all ui G F** and by the first part of this proof Mi and M[ are monomially equivalent. But 
then the same is true for M and M' . □ 

We close the section with briefly discussing the question whether there might exist a 
MacWilliams duality theorem for convolutional codes. For block codes this famous theorem 
states that the weight distribution of the dual code is fully determined by the weight distri- 
bution of the original code and a transformation formula is given, see, e. g., Thm. 3.5.3]. 
For convolutional codes one might think of two possible generalizations of this result, ei- 
ther to the weight distribution 0, or to the adjacency matrix A. As we will describe next, 
both cases have already been touched upon in the literature. In [2^ it has been shown 
that there does not exist a MacWilliams duality theorem for the weight distribution Q 
of convolutional codes. Precisely, the following example has been presented. Consider 
Gi = [1, z, 1 + z], G2 = [z, z,l + z] G F^^3. Then one shows that the weight distribu- 
tions of the two codes Ci := imGi and C2 = imG2 coincide. Indeed, they are both given by 
n = jt^/^- The dual codes are given by 

Ci = im ^ ^ , Co = im (, , ^ 
and it turns out that they have different weight distributions 



5 



^^1^ " l-LW-LW^ ' ~ 1-LW-LW^ ■ 

As a consequence there cannot exist a MacWilliams transformation mapping the weight 
distribution of a given code onto the weight distribution of the dual without using any further 
information. The example even shows more. Since multiplication by z is weight-preserving, 
the mapping uG 1 — > uG' yields an isometry between the codes C and C . But obviously, 
the codes are not monomially equivalent, showing that there is no MacWilliams Equivalence 



^One can prove this result straightforwardly for the field F2. However, I wish to thank Gert Schneider for 
pointing out the connection to MacWilliams' Equivalence Theorem to me. 
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Theorem for convolutional codes in this form (one would have to ahow at least rescaling by 
powers of z in monomial equivalence). Let us now discuss the adjacency matrices of these 
codes. Since the two codes are not monomially equivalent we know from Theorem I5.|-il that 
the generalized adjacency matrices of the two codes are not identical. Indeed, one computes 

/ / W\ 

Of course, the generalized adjacency matrices of the dual codes are different since the weight 
distributions are. They are given by 

^ ~\W + W'^ W + W^J ' 2 - [2W^ W + W^J ' 

At this point the question arises whether there exists a Mac Williams duality theorem for the 
adjacency matrices of convolutional codes. Indeed, in the paper |^ such a transformation 
has been established for codes with overall constraint length one. It is derived in totally 
different notation, but for codes with parameters (n, k, 1)2 it amounts after some rewriting 
to the formula 

= 2-''-\l + WrM^\i^ where M = (] \ V f 1 \ V (5-2) 
i+w \l —ij \l —LJ 

and where M\a denotes substitution of a for W in every entry of the matrix M and T = 
A + £^0,0 is the adjacency matrix of the extended state diagram (see also Lemma l5.2() . The 
formula can straightforwardly be verified for the two codes and their duals given above. 

We strongly believe that such a transformation also exists for codes with bigger overall 
constraint length. At least in the one-dimensional binary case with arbitrary overall constraint 
length we can establish the following support for this conjecture. 

Corollary 5.5 Let C, C Q F2((-2;))" be two binary one-dimensional codes. Then 

A(C) = A(C') =^ A(C^) = A(C'^). 

Proof: By virtue of Theorem 15.31 the assumption implies that C and C are monomially 
equivalent. It is trivial to see that then also C"*" and C'^ are monomially equivalent and thus 
Theorem lO yields A (C-^ ) = A (C^ ) . □ 

Unfortunately, the corollary does not reveal a formula transforming A(C) into A(C"'"). 

We close the section with the following 
Conjecture: Let C, C C F((z))" be two codes. Then 

A(C) = A(C') =^ A(C^) = A(C'^). 

6 Open Problems 

With this paper we want to initiate an investigation of the weight distribution and weight 
preserving maps for convolutional codes. The central object of our approach is the adjacency 
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matrix of the associated state diagram. In Theorem 15 . ISI we showed that for one-dimensional 
binary codes this matrix uniquely determines the code up to monomial equivalence. One 
immediately wonders whether this is true for one-dimensional codes over arbitrary fields as 
well. From block code theory it is known that such a result, however, cannot be expected 
for higher- dimensional codes. It would be helpful to see some examples with positive overall 
constraint length. Moreover, it needs to be investigated if isometrics between convolutional 
codes can be described explicitly. Finally, of course there remains the conjecture at the end 
of the last section that the adjacency matrix of a given code determines that of the dual code. 
While for codes with overall constraint length one a transformation has been derived in 
the general case has to remain open for future research. 

Appendix 

In the following lemma we use again the notation X^^^t] •= {Xa, Xa+i, ■ ■ ■ , Xf,) for X = 
{Xi, . . . , X^) G F''' and all 1 < a < 6 < 7. For X^^^a] we write, of course, simply Xa- 

Lemma A.l Let F = F2 and 7 > 2. Furthermore, let vr : F'^ — F''' be a bijective map with 
-7r(0) = and satisfying 

7r(u,X[i ,^_i])[2^^] = 7r(X)[i.^_i] for aU X e¥'^ and aU u £¥. (A.l) 

Then tt is the identity map. 

Proof: Denote by ei, . . . ,e^ the standard basis vectors on F'''. 

1) Using X = and ti = 1 we obtain vr(ei)[2^-y] = ^(0)[i,7-i] = (0)---)0)) thus 7r(ei) = 
(a, 0, ... , 0). Bijectivity of tt implies a = 1, thus 7r(ei) = ei. 

2) Using X = e^ and u = we obtain 

^('"'-'^[1,7-1]) [2,7] = ■^(0)[2,7] = = vr(e^)[i_^_i], 
thus vr(e^) = (0, . . . , 0, a) and again bijectivity of vr implies vr(e^) = e-y. 

3) Now we proceed by induction. Assume that there is some r > 1 such that 7r(A) = X for 
all A € F''' satisfying wt(A) < r and Ai = 1. By 1) this is true for r = 1. Then we have to 
show 

(i) 7r(A) = A for ah A such that wt(A) < r, 

(ii) 7r(A) = A for ah A such that wt(A) < r + 1 and Ai = 1. 

Ad (i): Pick A G F'^' such that wt(A) < r and Ai = 1. Put A^^) = (0, A[i^^_i]). Then 
vr(A(i))[2,^] = 7r(A)[i,^_i] = X[,^^_^, thus 7c{xW) = (ai, X[,^^_^). Put now A» = 
(0, . . . , 0, A[i G F"^. We proceed by induction on i. Thus by hypothesis we may as- 
sume 

7r(A«) = (ai,...,ai,A[i,^_,]). (A.2) 
Then A(*+i) = {0,X^l^^_^^) and thus 

7r(A(*+^))[2,^] = 7r(AW)j^^^_-^j = (a,, . . . , ai, A[i^^_j_i]). 

Therefore 7r(A(*+^)) = (oj+i, . . . , ai, A[i Hence (|A.2() holds true for alH = 1, . . . , 7—1- 

Now A^"^"^) = e-y. Hence by 2) of this proof e-y = tt{X^'^~^^) = (a^-i, . . . , oi, Ai). This 
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implies oi = . . . = a^_i = and hence n^X^"^^) = X^"^^ for all z = 1, ... ,7 — 1. Since each 
X E:¥"' such that wt(X) < r is of the form X^*) for a suitable X satisfying wt(X) < r and 
Xi = 1, this proves (i). 

Ad (ii): Let X G such that Xi = 1 and wt{X) < r + 1. Then X = (l,X[i^^_i]) for some 
X E F'^ such that wt(X) = wt(X[i < r. By part (i) we know that tt{X) = X as well as 

^(0,X[i,^_i]) = (0,X[i,^_i]). (A.3) 

Now (|XT|) yields 

Hence 7r(X) = {a, X^i^^_ij) and bijectivity of vr along with ()A.3|) yields a = 1. Thus vr(X) = 

X. □ 
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